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Abstract 

For a given class T of closed sets of a measured metric space {E,d,fi), we want to find 
the smallest element B of the class J- such that /i(-B) > 1 — a, for a given 0 < a < 1. This 
set B localizes the mass of /i. Replacing the measure fi by the empirical measure fin gives an 
empirical smallest set Bn- The article introduces a formal definition of small sets (and their 
size) and study the convergence of the sets Bn to B and of their size. 
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1 Introduction 

The framework of our study is a measured metric space (E, d, /r). Mass localization intends to find 
in this setting a small Borel set B such that n{B) > 1 — a for some given 0 < a < 1. The measure 
H conditioned on i? is a new measure that we say to be a-localized and denote This article 
provides a definition of a smallest Borel set of probability 1 — a in order to obtain a localized version 
of the measure with the smallest support possible. 

This smallest Borel set represents intuitively the "essential part" of the measure. However, it 
seems difficult to give an universal definition of ’smallest": although a ball centered on the origin 
as the smallest set with standard Gaussian measure on seems a good choice, it is not obvious to 
define such set if the measure is not unimodal or if it is not symmetric or if it is not even defined 
on an Euclidian space. 

Consistency is an important property we want for our notion. In statistics, the measure /r often 
unknown, is usually approximated by a sequence of probability measures {^in)n>i- The smallest 
closed set with -probability 1 — a should become closer to the smallest one of /^-probability 1 — a 
as n grows. 

Several methods have been studied in order to define such sets. 

A first method is to choose a class E of subsets of E partially ordered by their volume and to 
pick the smallest set (for this order) of this class with a /^-probability greater than 1 — a. This 
set corresponds to the level sets of a density function / whenever /t is absolutely continuous with 
respect to the Lebesgue measure and the class E contains the level sets. An other way to define 
this set is to maximize 

^i{B)-pXiB), (1) 

over B G E, where A is the Lebesgue measure and /t({/ > /3}) = 1 — a. This notion is known as 
excess mass. Denote by Bp the maximizer of m and by B^ the maximizer of 

/r„(H) -/3A(H), 

for /i„ the empirical measure. It is then of interest to determine if Bp converges to Bp and to 
exhibit a rate of convergence in this case. 

The article |Har87j considers the case of E being the set of all convex sets of and proves that 
the Hausdorff distance dniBn, B) between B^ and Bp converges to 0 and satisfies 

dH{BlBp)=o(^-^'^ ' . 

The article |Nol91j considers sets E as the set of all ellipsoids. Consistency of B'^ is proven, as well 
as the following limit theorem. Let c„ and c be the centers of the ellipsoids B’p and Bp respectively, 
and let (T„ and a be the vector containing the entries of the matrix defining the ellipsoids B"^ and 
Bp respectively, then, if the level sets of the measure /t are ellipsoids, 

n^^^{cn - C,(T„ - cr) 
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is weakly converging to the maximum of a Gaussian process. [Pol97| studies a more general case, 
with a different notion of convergence, and showed the consistency of B'p for the pseudo-distance 

d^{F,G)=fi{FAG), 

where A denotes the symmetric difference, whenever the class is a Glivenko-Cantelli class. Under 
several hypotheses including that the level sets of the measure belongs to F and regularity 
conditions on the article obtains the following rate of convergence 

for a constant S depending on the regularity of /r. This excess mass approach leads to rather precise 
results in many cases. However, it comes with few drawbacks, such as the condition that F must 
contains the level sets of the unknown measure which requires a certain knowledge on the /i. 
Requirements on the regularity of fj, can also be unsatisfactory for some applications. Also, this 
approach is restricted to the case of spaces with finite dimension (and often R*^). 

A second method comes from the notion of trimming on R extended to R^*. On R, the smallest 
set of /r-probability 1 — a is defined as 

[F-i(a/2);F-i(l-a/2)], 


where F is the cumulative distribution function of fj,. Replacing F by the empirical cumulative 
distribution function Fn defines the empirical smallest set. Extension to R'^ can be done in the 
following way: Ca denotes the intersection of all the closed half spaces of /r-probability greater than 
1 — a. Ga is then a non-empty convex set for a < 1/2, if the measure /r is regular enough. |Nol92| 
deals with the rate of convergence of Gn , defined similarly with the empirical measure /in and shows 
its consistency. In order to quantify the rate of convergence of G„ to Ca, the article introduce the 
following random functions 

r„(u) = inf{r > 0;ru ^ C„}, 


and 


ra(u) = inf{r > 0;rit ^ Ca}, 

and establishes the weak convergence to a Gaussian process defined on the unit sphere of the 
process 

Vnirn -Ta), 


under regularity conditions on the density function of /x. 

The article |GAGM9'7] presents another method, called a-trimmed fc-means, which introduces 
very few arbitrary parameters. This method chooses the support of the a-localized measure u as 
the one minimizing the distortion to its best fc-quantifier. Formally, for a given function <I>, and a 
given integer k, the method consists in choosing 


Sargmin'l inf / $( inf \\X — rmW] du; u(B) > 1 — a> . 

After proving the existence of such minimizer, the article [GAGM9'7] shows the consistency of Ba'- 
if itJ'n)n>i weakly converges to an absolutely continuous measure /x then, for any choice of 


B}} e argmin 


inf 


$ 


inf \\X - rriiW ] dfin, f^n{B) > 1 - a 

l<i<k 
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the sequence (i?2)™>i converges to (when unique) for the Hausdorff metric. Theses results hold 
on 

The main goal of our article is to provide a new definition that is intuitive and avoid usual 
hypotheses, that remains consistent. 

1.1 Definitions 

We define a notion of smallest closed set and introduce some properties that will help to understand 
its meaning. The framework of the definition aims to be fairly general. {E, d) is a Polish space 
(metric, separable and complete space) and fi is a Borel measure on (E,d). A smallest set will be 
defined as the minimizer of a function r defined on a class E of closed subsets of E. 

1.1.1 Stable set 

In order to ensure the existence of the smallest set in a class E of sets, the class needs to be stable 
in some way. The following definition of such stability will be an assumption made on the class. 
Let first set the following notation. 

For a given set B and e > 0, the set B^^ is the e-neighborhood of B: 

:= {x € A; G B, d{x, y) < e}. 

Definition 1 (Stable set). Let {Bn)n>i be a sequence of closed sets, denote lim„ the set 

limB„ := n u n 

£>0 fe>l n>k 

Let E be a class of closed sets of E. E is stable if E € E and 

iBn)n>i C E 3(nfc)fc>i,nfe-J-oo,lim5„^ G J". 

k 

This notion of stability is close to the completeness under Hausdorff convergence. Indeed, it is 
strictly equivalent if the metric space {E,d) is compact, as it will be discussed in the next remarks. 

As we defined as a subset of the closed sets of {E, d), we first check that our notion of stability 
makes sense for a class of closed sets. 

Remark 2. Given a sequence of sets {Bn)n>i, lim„ is always closed. Indeed, denote B(x,ef2) 
the ball centered in x of radius e/2, 

a; ^ limR„ 3e > 0,Vfc > l,3n >/c,x ^ (2) 

n 

3e > 0, Vfc > 1, 3n > k, B{x, e/2) fl = 0 (3) 

3e > 0, B(a:, e/2) n limB„ = 0. (4) 

n 

In other words, (lim„ BnY is open, and lim„ Bn is thus closed. 

The following remark aims to clarify stability. 
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Remark 3, When {Bn)n>i is converging to Boo for the Hausdorff metric, then 

lini Bji — Boo • 
n 

Indeed, denote the smallest e > 0 such that C and C i?® for all n > k, then, 

^»=nun^-^nu 

£>Ofc>ln>fc e>Ofc>ln>A: £>Ofc>ln>fc 

In a more general setting, given a sequence of closed balls {Kk)k>i such that Uk>iKk = E, and 
given a sequence (Bn)n>i, if there exists Boo such that for any k > 1, the sequence {Bn fl Kk)n>i 
converges in Hausdorff metric to Boo H Kk, then 

lim Bn — Boo • 

n 

Remark 4. In a metric space {E,d) such that every bounded closed set is compact (this is the case 
for instance, of locally compact length spaces), it is easier to understand the meaning of stability 
of a class. For any sequence {Bn)n>i of closed sets of E, and any sequence {Kk)k>i of increasing 
closed balls such that Uk>iKk = E, there exist a set Boo cind a subsequence (relabeled {Bn)n>i) 
such that Bn H Kk converges in Hausdorff metric to Boo cind 

lim Bn — Boo • 

n 

In this case, a stable class in the sense of definitions^ is just a compact class for the Hausdorff 
convergence on large balls. Indeed, in such spaces E, there exists an increasing sequence of com¬ 
pacts {Kk)k>i such that Uk>iKk = E, take for instance a sequence of balls centered on the same 
point, with an increasing radius. The Hausdorff convergence on large balls is then equivalent to the 
Hausdorff convergence of {Bn r\Kk)n>i for any k G N. Since the closed sets in Kk forms a compact 
class for the Hausdorff convergence, there exists a subsequence of {Bn H Kk)n>i converging to some 
B^. Using diagonal argument, we may extract a subsequence of the original sequence {Bn)n>i such 
that for any k G N, {Bn H Kk)n>i converges in Hausdorff metric to B^. It is easily checked that 
Boo '■= is a limit of a subsequence of {Bn)n>i, In the sense of definition[J\ 

Let us introduce some examples of stable sets. 

Example 5. Take T as the set of all closed sets. Stability is then obvious since the limit considered 
in the definition of a stable set is always closed as shown in the remark\^ 

Example 6. The set of all balls is generally not stable, but is does not take much to make is stable. 
The set of all closed balls and half spaces in is a stable class. This assertion can be proved using 
parametrization of the center of the balls in spherical coordinates and using compactness of spheres. 

Example 7. Other shapes of sets ofW^ make stable classes. Ellipsoids, rectangles, or convex bodies 
with bounded diameter (by some fixed R < oo) all form stable classes. And it is possible to get rid 
of the bounded diameter by adding some sets to the class. 

Example 8. If F is a stable class of convex sets of a metric space {E,d) such that closed balls are 
compacts, then 

Fe ■= {'JFGgE;G C F,\/F,G G G inf d{x,y)>e}, 

x&F,yGG 

is also a stable class (see lemma f^l . 
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1.1.2 Size function 

As we aim to define a smallest set of the sets of T, we need to define a notion of size. This is done 
using a function r, meant to measure the size of a set. In order to localize the mass, we will thus 
minimize the size of a set, among all sets given a probability measure. 

In order to express our assumptions on r, we first define the Hausdorjf contrast. 

Definition 9 (Hausdorff contrast). Let A and B be two closed subset of a Polish space {E, d). The 
Hausdorff contrast between A and B is defined by 

Haus(A|i?) := inf{e > 0|A C B®}. 

We can then remark that the Hausdorff metric dniA^B) between two closed sets A and B is 
then 

dniA, B) = Haus(A|i3) V Haus(i?|A). 

We now define formally a size function. 

Definition 10 (Size function). Let {E,d) be a metric space. A function t : E K."'" is called a 
size function if it satisfies the three following conditions: 

(HI) T is increasing, i.e. A G B '^(^) ^ '’'(B), 

(H2) for any decreasing seguence (A„)„>i C E such that t{Ai) < oo and Haus(A„| fl/c Ak) — t 0, 
the following holds r(A„) —>■ T(n„A„), 

(H3) for any sequence (A„)„>i C E, r(lim„ A„) < lim inf„ r(A„). 

Hypothesis (m|) on the size function requires some Hausdorff contrast. This particular choice 
make the hypothesis weaker and allow the hypothesis to hold for size function that give finite size 
to non compact sets. The consequences of these hypothesis will be more detailed in the sequel of 
the paper. 

1.2 Overview of the main result 

Our main result states that under the condition (HID, (I® and m, for the empirical measure 
Pm and a stable class E, 


r“ < lim inf t“ < lim sup t“ < lim ®, 

n „ £ 

where = min{T(i?); B € E, Pn{B) > 1 — a}. 

It implies the convergence of t“ when . i—>■ t' is continuous. 

The result actually holds for a wider class of sequence of measures {pn)n>i- 
Moreover, simple conditions on the sequence imply the convergence of the minimizers of the t“ 
for different metrics (depending on the conditions assumed). This is discussed in the next sections. 
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2 First properties 


2.1 Existence 

Let us recall the setting. {E,d) is a Polish space and /x is a Borel probability measure on {E,d). 
Given a size function r, a stable class E of closed sets of E, and a level a, we define the support 
B°‘ of the a-localized measure of /r by - when possible: 

B°‘ G argmin{r(A); A G E, n{A) > 1 — a}, 


and set 

Our first concern is whether B°‘ exists. It is the matter of the next result. 

Theorem 11 (Existence of a minimum). Let {E,d) be a Polish space, E a stable elass and p, a 
probability measure on {E,B{E)). Set 0 < a < 1. Suppose Then, there exists B € E such 

that 

B G argmin{r(A); A G E{E), fJ.{A) > 1 — a} . 

Remark 12. Hypothesis can not just be omitted. Indeed, if t{B) is defined as the Lebesgue 

measure of the closure of B on take 

H = a'jd + (1 - a)q, 

where q is a probability measure supported on and jd is the standard Gaussian measure on 
then, the sequence {Bn)n>i defined by 

En {Xk\l<k<n 0 B(0, rjif 

with {xn]n>i = o.nd rn —>■ 0 so that fi{Bn) = 1 — a, is a minimizing sequence. And T{Bn) = 
r (i?(0, r„)) so that r“ = 0 but r (Q'^) = +oo. 

The minimizer is not necessarily unique. This seems natural with the following example. Take 
p, as the uniform law on the unit square and an isometric r. Then any translation small enough 
of the minimizer will have the same size and the same measure, and will thus be another mini¬ 
mizer. Another result fcorollary 1251) will comfort us proving that minimizers form a compact set 
for Hausdorff metric. 

The stability condition on E is needed for existence of the minimum. However, it can be lightly 
weakened. 

Remark 13 (On stability of E). Sinee the minimal size min{T(A);A G E,pLn{A) > 1 — a} is 
bounded if t“ = min{T(A); A G E, p(A) > 1 — a} < oo, then we may suppose instead of stability of 
E that all the classes 

E^ := J'n{A;r(A) < M} 

for M < oo are stable. It is a weaker notion since T(hmi3„) < lim inf T(i?„) for any sequence 
{Bn)n>i in E, under (I^. 
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2.2 Regularity of r 

Denote 

r“ = inf{T(A); A € T, fJ.{A) > 1 — a, B}. 

It seems natural to expect a i—^ r“ to be continuous when /i is regular enough. It also seems natural, 
for instance, to have B°‘ growing continuously when a decreases to zero, for a unimodal measure 
This is the concern of this paragraph, the first one establishing the right continuity. 

Proposition 14 (Right continuity). Let {E,d) be a Polish space, p, a probability measure on 
{E,B{E)) and T a stable class. Let 0 < a < 1. Then, under (I{^, a i— >■ t“ is right continu¬ 
ous. 

The continuity will require some more hypotheses as shows the following example of discontinu¬ 
ity. Take p, = (Sj, -j- 5y)l2 and a = 1/2, and it is not difficult to find some r that is not continuous 
on a. 

Thus, it is clear that continuity property of this function needs regularity on the measure we 
want to localize, with respect to the class E. This is why we introduce the notion of J^-regularity. 

Definition 15 (J^-regularity). A probability measure p. is said to be E-regular if for all B G E, 
any <5 > 0 and any C € E such that B C C and p{B) < p{B^ fl C), there exists A G E such that 

Ac B^nc, 
p{B) < p{A). 

The only purpose of this notion is the continuity of the application a i—>■ r“. It is restrictive on p 
only when E is not rich enough. Taking E as the class of all closed sets of E make any probability 
measure J^-regular. Indeed, since p{B^ fl C) = lim„fl C), there exists n > 1 such that 
p{B) < p{B^~^^'^ n C) and then we can choose A := B^ (1 C. On the other hand, if E is not rich 
enough so that t(E) is not even connected, it is easy to build a measure p that is not J^-regular. 

Proposition 16 (Continuity). Let p be a probility measure on a Polish space (E,d). Suppose (I^, 
and (IW^, and that p is E-regular, has a connected support and that r“ is finite for any a > 0 
then, the mapping a i—>■ r“ is continuous. 

Remark 17. The condition r“ < oo just avoids a degenerated case. 

This continuity condition is a first step toward the main matters of our article, the consistency. 


3 Consistency 

3.1 r-tightness 

In order to show the consistency of the mass localization when a sequence of measures {pn)n>i 
converges to a measure p, we must make some assumptions on the sequence of measures. The first 
and most important hypothesis for consistency is the r-tightness. 

Definition 18 (r-tightness). A sequence of random probability measures {pn)n>i almost surely 
weakly converging to a measure p is t- tight if for any <5 > 0 and any B G E such that t{B) < oo, 


almost surely, for any C € T such that B C C and yL{B) < liminf„ yLn{C), there exists A ^ F such 
that 


h'iB) < lim inf 

n 

B c Ac B^ nc, 
t{A) < oo. 

An important remark on this definition is that a r-tight sequence of random measures does 
not have necessarily almost surely r-tight realizations. This can happen to empirical measures for 
instance. This subtlety lies in the position of "almost surely" in the definition, that is, after the 
choice of B and 5 made. 

We can also remark the following. Inequality yL{B) < lim inf„ /i„(C) is not a consequence oi B C 
C. Indeed, the portmanteau theorem states limsup„ p,„(C') < /r(C') and lim sup„< ^J.{B) 
siirce B and C are closed. The conditions for r-tightness on B G F such that fi{B) = lim„ fJ,n{B) is 
clearly verified for A := B. The definition of r-tightness can be understood as follows. Whenever 
{fJ'n)n>i does not catch all the /r-mass of B (i.e. lim inf„< n{B)) but some set C that 
contains B has its /i-mass well caught (i.e. yi.{B) < lim inf„ /r„(C')), then F must have an element 
A that also have its ^-mass well caught (i.e. fJ.{B) < liminf„/r„(A)), of finite size (i.e. r(A) < oo) 
and that is stuck between B and a ^-neighborhood of B intersected with C, for small S. 

The following proposition states that this notion is not empty, and includes the empirical mea¬ 
sures. 

Proposition 19 (r-tightness of the empirical measure). Let fx be a probability measure on E such 
that r“ < oo for 0 < a < 1. Let {Xi)i>i be a sequence of i.i.d. random variables with common law 
Set ' Lhen, (//n)n>i i^ T-tiqht. 

The empirical measure is actually not the only simple example of r-tight sequence. The following 
corollary gives a simple condition for a sequence of random probability measures to be r-tight. 

Corollary 20. Let {fJ.n)n>i be a sequence of random probability measure on E almost surely weakly 
converging to some measure p, such that r“ < oo, for any 0 < a < 1. If for all B G F, almost 
surely, p{B) < lim„/x„(i3), then {pn)n>i is r-tight. 

This corollary says that r-tightness is implied by almost sure convergence of Pn{B) for each 
B G F and thus, dropping the "almost sure" makes the r-tightness much more restrictive. 

We can now state our first result on consistency. 

3.2 r-consistency 

Our goal is to show that when /i„ converges to p, the size r“ of the smallest element of a given 
class F with pn-mass at least 1 — a converges to the size r“ of the smallest of p-mass at least 1 — a. 
In other words, we want to prove consistency of the smallest size r“. The following theorem states 
conditions for this consistency to hold. 

Theorem 21 (Consistency). Let {E,d) be a Polish space, F a stable class and {pn)n>i o, r- 
tight sequence of random probability measures on {E,B{E)) almost surely weakly converging to 
some measure p. Set 0 < a < 1. Choose any 5“ G argmin{r(A); A G F,Pn{A) >1 — 0 :} and 
dn — for all n > 1. Then, under hypotheses (ffQ]), (If^ and (I^, the sequence {Pn)nGn 
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is almost surely totally bounded for the weak convergence topology and := linifc Bn^. along any 
converging subsequence {p,nk)k>i of {fJ,n)n>i satisfies > 1 — a and almost surely 

t“ < r(S“ ) < liminf t(B“) < limsupT(B“) < lim 

n—>-oo e—>-0+ 

Moreover, if pL is B-regular and its support is connected, the five terms above are equal. 

Note that the r-tightness condition is required only for the last inequality. 

It is rather clear the if a i—>■ r“ is not continuous for the measure p, we can hardly expect 
consistency of the smallest size r“. This first step of consistency brings us to consider consistency 
of the smallest set of the class itself. 

3.3 Minimizer consistency 

The smallest set in J- with ^-mass greater than 1 — a is not always unique, and therefore consistency 
does not just mean that minimizer for /i„ converges to the minimizer for p. In order to give a sense 
to consistency, we will consider the set of all minimizers and the Hausdorff contrast between sets of 
elements of J- (for some underlying metric djr on T). We thus first recall the definition of Hausdorff 
contrast. Let A and B be two sets. The Hausdorff contrast between A and B is defined by 

Haus(H|i?) := inf{e > 0|H C B^}. 

Let us denote, for 0 < a < 1, a sequence of measures {pn)n>i and a measure p; 

5“ = argmin{r(H);H e B,Pn{A) >1-a}, 


and 

5“ = argmin{T(H); H £ T, p{A) > 1 — a}. 

The sets S°‘ and 5“ are thus two subsets of J-. What we want is to find conditions under which 


Haus(5“|5“) ^ 0, 


when n tends to infinity. 

We now state and comment briefly the two hypotheses that will be made for our main result. 
(H4) • I—r' is continuous at a, 

(H5) 'i{An)n>i C F such that r(lim„H„) < oo, 

limr(H„) = T(limH„) djF(A„, limH^) 0 


where djr denotes a metric on F. A typical example of such metric is the Hausdorff metric or 
the measure of symmetric difference. Section [4.21 is devoted to these examples and conditions that 
imply (1^5]). 

The continuity condition (LJT]) is a consequence of the proposition 1161 a connected support for 
an J^-regular measure suffices. 

We can now state a direct consequence of theorem Wf \ and hypotheses (HD) and (HD- 
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Theorem 22 (Consistency of the minimizers). Let {E, d) be a Polish space and {p,n)n>i a sequence 
of random r-tight probability measures on {E,B{E)) almost surely weakly converging to some T- 
regular measure /i. Set 0 < a < 1. Equip T with a metric djr 
Suppose (I^ (for the measure yt), and (If^. 

Then, almost surely, 

Haus(5“|5“) ^ 0. 

We now make some remarks on the necessity of the hypotheses. 

Remark 23 (On (IBI))' Continuity condition on t' at a can not be dropped. Indeed, if we 
choose dpLn = (l[o;i-a-i/rt] + l[ 2 ; 2 +a+i/rt ])<^'^7 then, taking for instance T{B) = A4{B,t)tdt, yields 
to the minimizers Supp = [0; 1 — a — 1/n] U [2; 2 + 1/n] and Supp = [0; 1 — a]. 

The result is stated with Hausdorff contrast and not Hausdorff metric. One can wonder what 
happens with the Hausdorff metric. 

Remark 24 (Convergence with Hausdorff metric). The convergence of Haus(iS“|5“) (i.e. the 
other contrast), is not always true. Take for instance t{B) = Ai(B,t)tdt (see definition \lI8\) and 
dpLn = fndX on K, with fn{x) = \ x/n for —1/2 < x < 1/2 and f{x) = 0 otherwise. Then, the 
minimizer [—1/2; 1/2 — a] for fi = l[o;i] is not included in any e-neighborhood of [—1/2 + a; 1/2]. 

When 5“ is a singleton, theorem|22|then states that any converges to the minimizer for the 
limit pL. When S°‘ is not a singleton, it states that all in Sf gets close to an element of S°‘, 
uniformly. However, the remark |M| precises that there could be some elements of S°‘ that will not 
be approximated. 

We can derive from the proof of this result the following corollary. 

Corollary 25. Let {E,d) be a Polish space and p, a probability measure on {E,B{E)) such that 
r“ < oo. Set 0 <a <1. Suppose (I{^, (I{^ and (I{^. Then argmin{r(H); H S T,p{A) > 1 — a} 
is compact for the Hausdorff metric topology. 

The tools developed in the proofs of these results were also effective to prove some continuity 
on the minimizer on the level a. 


3.4 Minimizer continuity 

The proof of the theorem of the minimizer consistency is based on the two lemmas [301 and |37| The 
same technique of proof leads to the following result. 

Proposition 26. Let {E,d) be a Polish space and a E-regular probability measure p on {E,M{E)). 
Set 0 < a < 1 and (a„)„>i converging to a. Suppose (If^, (IW})> (I^, ond (If^. Then, 

Haus(5“'‘|5“) ^ 0. 

In the case of a unique minimizer for all a, this result states the continuity of the function that 
associate a to the minimizer for a: 
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4 Examples 

In this section, we introduce examples of stable classes and size functions, in order to show the 
scope of our results. 

4.1 Examples of stable classes 

4.1.1 Closed sets 

The simplest example of stable class is the set of all closed sets. Indeed, given a sequence of closed 
sets {An)n>i, the limit lim4„ is also closed (see remark [2|). 

4.1.2 Parametrized classes 

On R'^, the set of closed balls and half spaces is also a stable class. Indeed, each ball can be 
parametrized by (x,r), the center and radius of the ball, and half spaces are limit (in the limit of 
sets we defined) of balls. From this parametrization, one can show that there exists a converging 
subsequence to any sequence of balls and half spaces in the sense we defined for the limit of sets. 

More generally, one can use the remark [T3] which states that for a given size function r, the 
condition on the class to be stable can be weakened to the condition that = J- C\{B\t{B) < 
M} is stable for each M £ R+. Then, taking any parametrized class J- such that convergence of 
the parameters implies convergence of the sets (in the sense defined for stable sets) and such that 
is compact for the Hausdorff metric gives a stable set, since Hausdorff convergence implies 
convergence in the sense we defined for stable sets. 

4.1.3 £-separated unions 

Another example of stable set is the one of e-separated union of elements of a stable class of convex 
sets (see lemma 1551) . For B a stable class of convex sets of a metric space [E, d) such that bounded 
sets are compact, the following set is stable 

Fe ■■= {yjp^gF] Q C F,yF,G e Q inf d{x, y)>e}. 

x&F,yGG 


This sets give an application to classification. 

4.2 Examples of size function 

4.2.1 Packing 

Our first example of size function is functions depending on the packing of sets. Let us recall the 
notion of packing and the more common one of covering (see |KT61j ). 

Definition 27 (t-covering). A set {Bi}i^j of subsets of E is at-covering of a set B if the diameter 
of any Bi does not exceed 2t and 

Bcz\Jb,. 

iel 

The cardinal of the smallest t-covering of B is then called covering number of B for t and is 
denoted N{B^ t). 
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The logarithm t) is sometimes called the metric entropy of B, or its Kolmogorov entropy. 

Definition 28 (t-separated). A set B is t-separated if the distance of every two distinct points of 
B is strictly greater then t. The cardinal of the greatest t-separated subset of B is called packing 
number of B for t and is denoted Ai{B,t). 

The logarithm of A4{B,t) is sometimes called capacity of B. 

These two notions are intuitively linked and carry information on the size of the set. The 
following proposition due to [KT61| compares Af and A4. 

Proposition 29. For any set B of a metric space {E, d), 

M(B,2t)<M{B,t)=M{B,t) < MiB,t)=M{B,t). 

The idea to make the notion of size depends on the packing of the set is not random and comes 
from the fact that packing appears in many notions of size. For instance, the packing measure 
defined in [SRT88| is another definition of the Lebesgue measure (up to some constant factor) and 
is in the more general case of a metric space an isometric measure that coincide with Hausdorff 
measure on spaces with non fractional dimension as shown in [SRT88j . 

We will then study size function t of the form 

t{B)=^M{B,.)), 

for all B G F for a function $ on the set of packing functions. 

In order to ensure that the condition (HSl) is fulfilled, we define the following hypotheses. 

(Him r is strictly increasing (i.e. A G B,A ^ B t(^) < 't{B)), 

(Him for E^riy sequence (Hn)n>i C F, <i)(liminf Af (i?„,.)) < liminf d>(Af (limi?„,.)), 

(H6) t{A) < oo A totally bounded. 

Hypotheses (Him (Him imply respectively (Id) and (him and we can show that, together 
with (I®, they imply (I®, so that we have the following theorem. 

Theorem 30 (Packing size function). Suppose that r is a size function of the form t{B) = 
d>(Af(i?,.)) on a stable class F. Suppose (Hl^, (/Q), (Hf^, and (7®). 

Then, almost surely, 

Haus(5“|5“) ^ 0. 

Remark 31 (On (I®). Since a non compact space can not be arbitrarily close to a finite set 
in Hausdorff distance, then for Btf to converge when {p,n)n>i is sequence of finitely supported 
measures, minimizers relative to p must he compact if F is rich enough to make minimizers finite 
sets. Minimizers relative to p are compact when t{B) < oo implies that B is totally bounded. 

4.3 Examples of sequence of measures 

Proposition 115] and theorem 155] can be applied to the empirical measure and then lead to the 
following. 
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Theorem 32 (Consistency for empirical measure: i.i.d. case). Let {E,d) be a Polish space, ji a 
probability measure on {E,M(E)) and T a stable set such that t“ < oo, for any 0 < a < 1. Given 
a sample {Xn)n>i of independent random variables with same law p,, set 

h-n = - 'Y] Sxi, 

\<i<n 


the empirical measure. Suppose (m, (m, (iw and Then, almost surely, 

Haus(5“|5“) 0. 


Likewise, corollary leads to convergence of empirical measure for some dependent cases. 

Theorem 33 (Consistency for empirical measure: dependent case). Let {E,d) be a Polish space 
and fi be a probability measure on {E,'E>{E)) such that r“ < oo, for any 0 < a < 1. Given random 
variables {Xn)n>i from an ergodic Markov chain with invariant measure p,, denote 



l<i<n 


the empirical measure. Suppose (UBi), (I{3j), (I^, and Then, almost surely, 

Haus(5“|5“) ^ 0. 


5 Proofs 

The following lemma is the starting point of the existence of minimizer theorem. It is also a key 
lemma to most of the other results. 

Lemma 34. Let {E,d) be a Polish space. Let {pn)n>i be a sequence of probability measures weakly 
converging to poo- Then, 

Supp Poo C lim. Supp pn. 

n 

Moreover, for any t > 0 

M(Supp Poo,t) < M.i\vniSupp pn,t) < limmiM.{Supp Pn,t)- 

n n 

Proof of lemmal^ Set t > 0 and x € Supp poo. Then, for all e > 0, there exists rj > 0 such that 
Poo{B{x,e/2)) > rj. Then, using the portmanteau theorem, for n large enough, pn{B{x,e/2)) > rj 
and thus, x G (Supp pnY for any n large enough, which proves the hrst point and first inequality. 

For the second inequality, choose m < AI(lim„Supp pn,t). Let {xi}i<i<m be a t-separated 
subset of lim„ Supp pn. Then, there exists <5 > 0 such that 


inf d(xi,Xj) > t + 5. 

From definition of lim„ Supp pn and since m is finite, there exists k > 1 such that {xi)i>i>m C 
nn>fe(Supp PnY^'^ 
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Choose now x" G B{xi,6/2) fl Supp fin for 1 < i < m. {x"}i<i<m forms a f-separated set 
included in Supp /i„ since for i ^ j 

d(x”, x") > d{xi, Xj) — d(xi, x”) — d(xj, x") 

>t + 5 - 5/2- 5/2 = 1. 

Thus, m < liminf„ Supp fin,t), for any m < A^(lim„ Supp fin,t), which ends the proof. □ 

Proof of theorem \ll\ Let {Bn)n>i C be a minimizing sequence such that fi{Bn) > I — a for all 
n > 1. Set fin = fi{.\Bn). Let first show that {fin)n>i is tight. Let iL be a compact set such that 
fi{K) > 1 — e(l — a), then 


fln{K) = fl{K n Bn)/fl{Bn) 

= {l-fi{K^UB/,))/fi{Bn) 

> 1 - fl{K^)/fl{Bn) 

> 1 — e. 

Thus, up to a subsequence, {fin)n>i weakly converges to fioo- Set B = lim„ Supp fin- B^ is an open 
set and contains Supp /ioo from lemma |34l Then, portmanteau theorem gives 

1 = Moo(5") ^ = fliB^ n Bn)/fl{Bn) 

<fi[B^)l{l-a). 

Letting e tends to zero and using ne>o ~ ff comes fi{B) > 1 — a. 

Proof ends using (I^ and stability of P. □ 

Proof of proposition [7^ Clearly, a i— t“ decreasing, and we thus only need to show r“ > lim£_>o+ . 
Set Ba G argmin{T(A); A € P,fi{A) > 1 — a} and fp = fi{.\Ba). For any (a„)n>i C (0; 1) de¬ 
creasing and converging to a, {fi°‘"-)n>i is tight. Indeed, for a compact set K such that fi{K) > 

1 - e(l - ai). 


fi^-{K)= fi{KyjBn,^)/fi{B^„) 

> {l-fi{K^)-fi{BlJ/fi{B^^) 

= l-fi{K-)/fi{B^^) 

> 1 - e. 

Thus, up to a subsequence, converges to some probability measure fioo- Set Boo = lim„ Supp fin- 
B%o is an open set that contains Supp fioo, and thus portmanteau theorem yields, 

1 = A^oo(i?L) 

=d{B‘oo^Bo^)/fi{Bo,J 

<m(S^)/(1 - an) —^ fi{Blo)l{l - a). 

Letting e tends to zero shows fi{Boo) > 1 — a. Hypothesis (tO]) and stability of P let us conclude 

r“ < t{Boo) < liminf = limT(i?Q^). 

n n 

□ 
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In order to show continuity of the size function fproposition 1161) . we establish a lemma that 
states that for a connected measure, any e-neighborhood of Borel set has strictly more mass than 
the original Borel set. 

Lemma 35. Let fi be a probability measure with a connected support, then for any Borel set A such 
that p{A) < 1 and any e > 0, the e-neighborhood of A satisfies 

Pi{A-) > pi{A). 

Proof of lemmaW^ In order to show a contradiction, suppose yL{A^) = p{A) for some e > 0. Then, 
^(A^/3) = ^[A) and pL{{{A^Yf/^) > pl{{AYY = 1 - so that pl{A^/^ U ^ 

^e/3 n ((A^)'=)^/3 = 0 and A®/3 and ((A®)‘=)^/3 are closed, the support of /i not connected, which 
contradicts the hypothesis. □ 


We can now prove continuity of the smallest size function. 

Proof of vrovosition \lb\ By proposition [TT] it is enough to show that a i—>■ t“ is left continuous. 
Moreover, since a i—>■ t“ is decreasing, it is enough to prove lime_>o+ < t“. 

Let a > 0 and B G P such that r“ = t{B). If p,{B) > I — a then the result is obvious. Suppose 
then p{B) = 1 — a. By lemma ESI for any (5 > 0 there exists e > 0 such that fi{B^) > 1 — a e. 
Define a decreasing sequence {Sn)n>i converging to zero, such that fi{B^’^) > 1 — a for any n > 1. 
Since /i is J^-regular, there exists {Kn)n>i C P such that for any n > 1 

t{Ki) < oo, since < oo, 

1 G; 1 

Clearly, —>■ a. Then, using (HI]) and (HD), it yields 

lim ^ < T(iL„) ^ rinM = t“. 

E^0+ 

□ 

Proof of proposition^!^ Since t“ is hnite, for any 0 < a < 1, there exists B G P such that 
t{B) < oo and yL{B) > 1 — a. For such B, the law of large number states that almost surely 
lim„ Hn{B) = P^{B). Set 5 > 0. Then for any C G P such that B C C and pl{B) < liminf„ pn{C), 
for A := B G P the three following conditions are clearly true 


p{B) < liminf 

n 

B c Ac B^ nc, 

t{A) < oo. 


□ 

Proof of theorem [¥J[ Let iL be a compact set such that Pn{K) > 1 — e(l — ot) for any n > 1 and 
p{K) > 1 — e(l — a). Then, 

> 1 - er, 
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showing tightness of (/J.“)ra>i- Denote a limit measure and = lim„ Supp /j,“. Set e = 
inf {77 > 0; Hn{B) < + ? 7 , VS G B(S)} the Prokhorov distance between 77 and Then using 

together the portmanteau theorem and the fact that the topology induced by the Prokhorov metric 
coincides with the weak convergence topology, for any 77 > 0 there exists Urj such that for all n > nr; 
- since Supp C from lemma [34l 

= Mn((S“rnS“)/77„(S“)+77 

< fj,n{iB^y )/(1 — a) + 77 

< + e)/(l - a) + 77 —> /i(S“ )/(l - a). 

Thus, fJ-{B^) >1 — 0 ! and thus by definition of t“, 

< r(S“ ). 

The second inequality is a direct application of (USD. 

Let us relabel a subsequence of {fin)n>i so that limsup„_,,oo r(S“) = lim„_>oo''"(.S"). Choose 
£ > 0, and Bg G argmin{r(yl); A G > 1 — a + £}. Without loss of generality, we can 

suppose that T(Bg) < 00 . Since the sequence (fin)n>i converges to /i, the portmanteau theorem 
states that for any (5 > 0 , 

liminf > fi(Bs). 

n 

Set <5i > 0. Then, since E G J-, r-tightness implies that there exists A\ G T such that 

t{Ai) < 00 , 

B.cA^CBy, 

1 — a + e < fi{By <luniidfin{Ai), a.s.. 

n 

Let {5k)k>i be a decreasing sequence converging to zero. Using recursively the r-tightness 
property of the sequence [^in)n>i, with sets Ak-i-, define a decreasing sequence {Ak)k>i of elements 
of E, such that Ak C B^ for any A: > 1, and 

Be cAkC B^y , 

1 — a -I- £ < n{Be) < liminf fin{Ak) a.s.. 

n 

Then, for any fc > 1, there exists > 1 such that ^n{Ak) > 1 — a for any n > Uk- Then, since 
Ak G E, t{B'^) < T{Ak) for all n > Uk- By construction and since B^ is closed, (IkAk = B^. Using 
(HID and (H2D yields 

'r(B“) < T{Ak) —>k r(nfcAfc) = r(Be), 
which shows that for any £ > 0 , 

limsupr(B“) < t“-U 

n 

The inequality of the five terms is a direct application of proposition 1161 □ 
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Pvoof of thcOTGUl 1 2^ CtlOOSG ^ Sct — f^n\B^ • TtlGOrGIIl El lemma [M] and continuity 
condition (tjl]) of • !->■ r' shows that {^J,n)n>l is totally bounded for the weak convergence topology 
and that 5“ := lim^ i3„, € 5“ where (nk)k>i is a subsequence along which u“ converges. Theorem 
Eland (USD then yield 

(5) 

for the subsequence of (i?ri)n>i- Since (O holds for a subsequence of any subsequence of the original 
sequence taking 5“ € 5“ so that 1/n + B^) > Haus(5“|5“) for any B^ G 5“, 

it follows 

Haus(5“|5“) ^ 0. 

□ 

Proof of vrovosition Choose a sequence (an)n>i converging to a. From continuity of the size 
function, for any n large enough, r“" < oo. Define /i„ = /i(.|S„) for any Bn € Sequence 

ipn)n>i is tight. Indeed, let isT be a compact set such that > 1 — e(l — max„>i q;„), then 

Hn{K) = H{K n Bn)/n{Bn) 

= {l-^l{K^UB^n))/KBn) 

> 1 - ^l{K-)/^^{Bn) 

> 1 — e. 

Set Boo = hmfc Bn^ with (rik)k>i a subsequence along which (/j.„)„>i converges (such subsequence 
exists since (/in)n>i is tight). Set /ioo = hm^ Using similar arguments to those in theorem [TTl 
we can show that /r(i?oo) > 1 — a. Thus hypothesis (tO]) yields 

r“ < t{Boo) < liminf T(i?„) = limr“" = r“, (6) 

where the last equality is due to the continuity condition (HI])- (H5]) implies 

dj^{Boc-, Bn) ^ 0- (7) 

Remark then that corollary El establishes that 5“" is compact for the Hausdorff metric topology, 
with underlying metric dj^. Since © holds for a subsequence of any subsequence of the original 
sequence {Bn)n>i, taking 


Bn G argmax{ min dH{B,A)\A G 5“"}, 

so that dniBn, Boo) > Haus(5“"|5“) for any Boo S 5“ yields 

Haus(5“"|5“) ^ 0. 


□ 

Proof of corollary \ 2, Set (/in)ra>i = (m) n>i in the theorem El and the result is then a direct 
application of (H) from the proof of the proposition. □ 
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Proof of coroUarvlWA In order to prove this result, we show that (H[T]), (Hl3]) and (EJH)) all together 
imply (HU), (H21) and (HSl). Thus, suppose (H[T]), (Hl3]) and (HH]). Then, (HIl) and (HSl) are 
obviously true. 

Now, in order to show (USD, choose a sequence {An)n>i C T such that T(lim„ < oo, and 
limr(A„) = r(lim Then, lemma [36] implies that 

H{\\Ta.Ak\An) 0 . 
k 

Moreover, 

limT(A„) = lim<I>(AI(A„,.)) 

n n 

> $(liminf .) using (H|3]) 

> $(AI(lim A„,.)) using lemma [Ml 

n 

= T(lim A„). 

Thus, since we supposed lim r(A„) = r(lim A„), and because d) is supposed to be strictely increasing 
by (HUD, lemma |53| shows 

lim inf (H„, t) = AI(limH„, t), Vt > 0. 

Lemma 1571 can thus be applied to show that, along a subsequence, 

dniAn^limAk) 0 . 

k 

Since this is true for any subsequence, it holds for the sequence itself, i.e. (HHD holds. 

The result is then an application of theorem \T2\ □ 

Lemma 36. Let {E, d) be a Polish space and {Bn)n>i be a sequence of set of E. Then, for any 
e > 0 such that A4(lmik Bfc, e) < oo, there exists no S N such that for all n > no, 

limHfc C BlP 

k 

Proof of lemma\E^ Set £ > 0 and m = Bk,e). Let {xi}i<i<m be a maximal £-separated 

subset of lim„ Bn- Then, by definition of lim^ B^ and since m is finite, for any d > 0, there exists 
ks such that for all n > ks, 

C B^. 

Maximality of {xi}i<i<m ensures 

limHfcC y B{xi,e) (Z B^^. 

l<2<m 

Proofs ends when choosing S < e. □ 

Lemma 37. Let {E,d) be a metric space. Let {Bn)n>i be a sequence of closed sets such that 
Haus(i3|i3„) 0, for some closed set B. Suppose that for any s > 0, there exists 0 < t < s such 
that 

liminiM{Bn,t) < M{B,t) < oo. (8) 

Then along a (relabeled) subsequence {Bn)n>i, 

diiiBn, B) —>■ 0. 
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Proof of lemma [571 It suffices to show 


Ve > 0, VriQ € N, 3n > no, Bn C (By. 

In order to prove a contradiction, suppose 

3e > 0,3no S N,Vn > no, Bn (f. {By. (9) 


Then, 


3e > 0, 3no £ N, Vn > no, 3x„ £ Bn,'iy £ B, d{xn, y) > £■ 


Set t < e and rn = M{B,t). Since M{B,.) is right continuous, there exists <5 > 0 such that 


m = M{B,f) = A4{B,t + 6). 


Let {zi}i<m be a maximal {t + (I)-separated subset of B. Then, since for any rj > 0 and n large 
enough, 

B C {Bny, 

we can construct {yi}i<m C Bn such that for all 1 < i < m, d{yi, zf) < rj. Thus, for any i y j, 

d{yi,yj) > d{zi,zj) - d{zi,yi) - d{zj,yj) > t + S - 2 t]. 

Choose then y < 5/2f\{e — t) such that {yi}i<m is a t-separated subset of Bn- Then, {x„}U{j/i}i<m 
is also a t-separated subset of since £ Bn and 


d{xn,yi) > d{xn,Zi) - d{zi,yi) > e-r]> t, 

showing thus M{B,t) + l < M{Bn, t), for any n large enough. This contradicts ([5]) and thus shows 
that (in]) is false. □ 

Lemma 38. Let P be a stable class of connected sets of a metric space {E, d) such that bounded sets 
are compact. Suppose that there exists an increasing union {Kk)k>i of balls verifying Uk>iKk = E 
such that for all k > 1 and any F £ P, Kk H F is still connected then 

Pe ■= yjF&gF-,Q <Z P,'iF,G & Q inf d{x,y)>£}, 

x&F.yeG 


is also a stable class. 

Proof. Set {Bn)n>i C RemarklUstates that there exists Boo such that a subsequence of {Bn)n>i 
converges to Boo for the generalized Hausdorff convergence. Suppose that there exist two connected 
components Fi,F 2 C Boo such that 

d{Fi,F 2 ) := mf{d{x,y)-,x G Fi,y G F 2 } < £. 

Then, the hypothesis on E implies that for any ball K for which K HFi y% and iC fl T 2 0, there 
exists Xi G Fir\ K and X2 G F 2 C\ K such that d{Fi fl F 2 fl K) = d{x, y). One can check that if 
X and y belong to the interior of K (which we can assume without loss of generality), it holds 

dniBn^K, Boof^K)^Q. 
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Taking then K G {Kk}k>i large enough so that 


d(Fi n F 2 n K) = rj < e. 


and setting 

5 = d{Kf^Fl,Kf^B^\Fl)^d{Kf^Fl,Kf^B^\Fl) > 0 , 

yields for all n large enough 

F^r\Kc. n iG and i^2 n if C ( 10 ) 

-B„ n if n Ak ( 11 ) 

m shows then that there exist yi,y 2 S Bnr\K such that d{xi,yi) < {{e — r])AS)/3 and d(x 2 , 2 / 2 ) < 
((e — T]) A S)/3 and so 


d{yi,y2) < d{yi,xi) +d{xi,X2) +d{x2,y2) 

< 2{e - r])/3 + y 

<2s/3 + y/3 <e. (12) 

And (fTTll shows that BndKnFf^^ C F^^'^^flAT and BndKdF^^^ C F^^'^dK, since Arni?M"^nA/^^ C 
K n F^^'^ for i = 1, 2 by definition of <5. It follows that yi G B^d K d F^^"^ and y 2 G B^d K d F^^"^ 
are included in two distinct connected components Bn H AT, which contradicts m and hypothesis 
that Bn G Fe¬ 
lt is then easily checked that Fi, F 2 G F. □ 


6 Conclusion 

We have thus defined a new way to localize mass of measure that is consistent for empirical mea¬ 
sures, in Hausdorff metric. It is rather intuitive and applies to any Polish space, including infinite 
dimensional spaces. It thus provides an analogy to level sets in theses spaces. The major drawbacks 
of our methods lie in the computability of the size function for rich classes F and the lack of rate 
of convergence for now. 
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